Minimum risk methods in the estimation of unknown sparsity
نویسنده
چکیده
The selection of significant components in a sparse vector, directly observed with i.i.d. observational noise, typically proceeds by thresholding the observations. The objective in this paper is to choose the threshold that minimizes the risk (expected squared prediction error) of the estimator with respect to the noise-free sparse vector. The risk as a function of the model size (or, equivalently, as a function of the threshold value) is estimated by Stein’s Unbiased Risk Estimator (SURE) and by Generalized Cross Validation (GCV). Minimization of these criteria leads to additional randomness, generally not taken into account. A correction for this minimization randomness is proposed. The analysis starts from the observation that soft-thresholding data with normal noise allows unbiased risk estimators. The step towards hardthresholding is then generalized to other noise models and — through a link between SURE and Mallows’ Cp — to other selection criteria. The case of Akaike’s Information Criterion (AIC) is further elaborated. In a second theme of this paper, the use of GCV for sparse model selection is motivated by two new theorems, one for the asymptotic behavior near the optimal threshold, the other for the asymptotic behavior for thresholds close to zero. A third theme of this paper is the application of GCV to iterative threshold procedures for estimating sparse vectors from indirect observations, such as in inverse problems, compressed sensing and so on. We propose to iterate with a step-dependent threshold and find experimentally that this allows closer fit than any fixed threshold. keywords sparsity, thresholding, wavelet, model selection, Mallows’ Cp, Stein Unbiased Risk Estimation, generalized cross validation, robust AMS 2000 subject classification Primary 62J07,62J02; secondary 62G08
منابع مشابه
A Sharp Sufficient Condition for Sparsity Pattern Recovery
Sufficient number of linear and noisy measurements for exact and approximate sparsity pattern/support set recovery in the high dimensional setting is derived. Although this problem as been addressed in the recent literature, there is still considerable gaps between those results and the exact limits of the perfect support set recovery. To reduce this gap, in this paper, the sufficient con...
متن کاملValue at Risk Estimation using the Kappa Distribution with Application to Insurance Data
The heavy tailed distributions have mostly been used for modeling the financial data. The kappa distribution has higher peak and heavier tail than the normal distribution. In this paper, we consider the estimation of the three unknown parameters of a Kappa distribution for evaluating the value at risk measure. The value at risk (VaR) as a quantile of a distribution is one of the import...
متن کاملجهت یابی چند گوینده با استفاده از روش WCSSDOA
In this paper we propose the spatial sparsity based WCSSDOA method for multi speakers' Direction of arrival estimation. In the proposed method the sparse modeling is done based on the sensor signals' correlation matrix, which leads to low computational complexity. In this method the SVD decomposition of the noise covariance matrix is proposed to reach the free noise sparse model, which leads to...
متن کاملA Novel DOA Estimation Approach for Unknown Coherent Source Groups with Coherent Signals
In this paper, a new combination of Minimum Description Length (MDL) or Eigenvalue Gradient Method (EGM), Joint Approximate Diagonalization of Eigenmatrices (JADE) and Modified Forward-Backward Linear Prediction (MFBLP) algorithms is proposed which determines the number of non-coherent source groups and estimates the Direction Of Arrivals (DOAs) of coherent signals in each group. First, the MDL...
متن کاملMultivariate geostatistical estimation using minimum spatial cross-correlation factors (Case study: Cubuk Andesite quarry, Ankara, Turkey)
The quality properties of andesite (Unit Volume Weight, Uniaxial Compression Strength, Los500, etc.) are required to determine the exploitable blocks and their sequence of extraction. However, the number of samples that can be taken and analyzed is restricted, and thus the quality properties should be estimated at unknown locations. Cokriging has been traditionally used in the estimation of spa...
متن کاملImproved Channel Estimation for DVB-T2 Systems by Utilizing Side Information on OFDM Sparse Channel Estimation
The second generation of digital video broadcasting (DVB-T2) standard utilizes orthogonal frequency division multiplexing (OFDM) system to reduce and to compensate the channel effects by utilizing its estimation. Since wireless channels are inherently sparse, it is possible to utilize sparse representation (SR) methods to estimate the channel. In addition to sparsity feature of the channel, the...
متن کامل